A unified framework for translation and understanding allowing discriminative joint decoding for multilingual speech semantic interpretation
نویسندگان
چکیده
Probabilistic approaches are now widespread in most natural language processing applications and selection of a particular approach usually depends on the task at hand. Targeting speech semantic interpretation in a multilingual context, this paper presents a comparison between the state-of-the-art methods used for machine translation and speech understanding. This comparison justifies our proposition of a unified framework for both tasks based on a discriminative approach. We demonstrate that this framework can be used to perform a joint translation-understanding decoding which allows to combine, in the same process, translation and semantic tagging scores of a sentence. A cascade of finite-state transducers is used to compose the translation and understanding hypothesis graphs (1-bests, word graphs or confusion networks). Not only this proposition is competitive with the state-of-the-art techniques but its framework is even more attractive as it can be generalized to other components of human-machine vocal interfaces (e.g. speech recogniser) so as to allow a richer transmission of information between them.
منابع مشابه
Discriminative statistical approaches for multilingual speech understanding (Approches statistiques discriminantes pour l'interprétation sémantique multilingue de la parole) [in French]
Discriminative statistical approaches for multilingual speech understanding Statistical approaches are now widespread in the various applications of natural language processing and the elicitation of an approach usually depends on the targeted task. This paper presents a comparison between the methods used for machine translation and speech understanding. This comparison allows to propose a uni...
متن کاملA Contrastive Analysis of Persian and English Compliment, Request, and Invitation Patterns within the Semantic Metalanguage Framework
Speech acts shape the core of pragmatic competence and their mastery is a prerequisite for successful dis- course encounters in an L2. Cross-cultural comparisons, as claimed by Johnstone (2018), are very effec- tive for knowing and acquiring the speech acts. Accordingly, the purpose of this study was to compare a limited number of communicative routines in English and Persian within the f...
متن کاملJoint Processing and Discriminative Training for Letter-to-Phoneme Conversion
We present a discriminative structureprediction model for the letter-to-phoneme task, a crucial step in text-to-speech processing. Our method encompasses three tasks that have been previously handled separately: input segmentation, phoneme prediction, and sequence modeling. The key idea is online discriminative training, which updates parameters according to a comparison of the current system o...
متن کاملArticles: A Framework for Fast Incremental Interpretation during Speech Decoding
This article describes a framework for incorporating referential semantic information from a world model or ontology directly into a probabilistic language model of the sort commonly used in speech recognition, where it can be probabilistically weighted together with phonological and syntactic factors as an integral part of the decoding process. Introducing world model referents into the decodi...
متن کاملA Framework for Fast Incremental Interpretation during Speech Decoding
This paper describes a framework for incorporating referential semantic information from a world model or ontology directly into a probabilistic language model of the sort commonly used in speech recognition, where it can be probabilistically weighted together with phonological and syntactic factors as an integral part of the decoding process. Introducing world model referents into the decoding...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Computer Speech & Language
دوره 35 شماره
صفحات -
تاریخ انتشار 2016